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Abstract 

In the problem Max Lin, we are given a system Az = b of m linear equations 
with n variables over F2 in which each equation is assigned a positive weight 
and we wish to find an assignment of values to the variables that maximizes the 
excess, which is the total weight of satisfied equations minus the total weight of 
falsified equations. Using an algebraic approach, we obtain a lower bound for 
the maximum excess. 

Max Lin Above Average (Max Lin AA) is a parameterized version of Max 
Lin introduced by Mahajan et al. (Proc. IWPEC'06 and J. Comput. Syst. Sci. 
75, 2009). In Max Lin AA all weights are integral and we are to decide whether 
the maximum excess is at least k, where k is the parameter. 

It is not hard to see that we may assume that no two equations in Az = b 
have the same left-hand side and n = rankA. Using our maximum excess 
results, we prove that, under these assumptions, Max Lin AA is fixed-parameter 
tractable for a wide special case: m < 2 p ' n - ) for an arbitrary fixed function pin) = 
o(n). This result generalizes earlier results by Crowston et al. (arXiv:0911.5384 ) 
and Gutin et al. (Proc. IWPEC'09). We also prove that Max Lin AA is 
polynomial-time solvable for every fixed k and, moreover, Max Lin A A is in the 
parameterized complexity class W[P]. 

Max r-Lin AA is a special case of Max Lin AA, where each equation has at 
most r variables. In Max Exact r-SAT AA we are given a multiset of m clauses 
on n variables such that each clause has r variables and asked whether there is 
a truth assignment to the n variables that satisfies at least (1 — 2~ r )m + k2~ r 
clauses. Using our maximum excess results, we prove that for each fixed r > 2, 
Max r-Lin AA and Max Exact r-SAT AA can be solved in time 2° (fe log fc) +m° (1) . 
This improves 2° (h - 1 + m° ( ' 1 '-time algorithms for the two problems obtained by 
Gutin et al. (IWPEC 2009) and Alon et al. (SODA 2010), respectively. 

It is easy to see that maximization of arbitrary pseudo-boolean functions, 
i.e., functions / : { — l,+l} n — > M, represented by their Fourier expansions 
is equivalent to solving Max Lin. Using our main maximum excess result, we 
obtain a tight lower bound on the maxima of pseudo-boolean functions. 
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In the problem Max Lin, we are given a system Az — b of m linear equations in n 
variables over F2 in which each equation is assigned a positive weight and we wish to 
find an assignment of values to the variables in order to maximize the total weight 
of satisfied equations. A special case of Max Lin when each equation has at most r 
variables is called Max r-LiN. 

Various algorithmic aspects of Max Lin have been well-studied (cf. [21 HU1 ITT] ). 
Perhaps, the best known result on Max Lin is the following inapproximability theo- 
rem of Hastad [TU]: unless P=NP, for each e > there is no polynomial time algorithm 
for distinguishing instances of Max 3-Lin in which at least (1 — e)m equations can 
be simultaneously satisfied from instances in which less than (1/2 + e)m equations 
can be simultaneously satisfied. 

Notice that maximizing the total weight of satisfied equations is equivalent to 
maximizing the excess, which is the total weight of satisfied equations minus the 
total weight of falsified equations. In Section [3J we investigate lower bounds for the 
maximum excess. Using an algebraic approach, we prove the following main result: 
Let Az = b be a Max Lin system such that rankj4 = n and no pair of equations has 
the same left-hand side, let u> m i n be the minimum weight of an equation in Az = b, 
and let k > 2. If k < m < 2™/( fc ~ 1 ) — 2, then the maximum excess of Az = b is at 
least k ■ w m i n . Moreover, we can find an assignment that achieves an excess of at least 
k ■ Wmin in time mP^. 

Using this and other results of Section[2]we prove parameterized complexity results 
of Section [3] To describe these results we need the following notions, most of which 
can be found in monographs [6j [7J [15] . 

A parameterized problem is a subset L C E* x N over a finite alphabet E. L is 
fixed-parameter tractable if the membership of an instance (x, k) in E* x N can be 
decided in time f(k)\x\°^ , where / is a computable function of the parameter k. 
When the decision time is replaced by the much more powerful 

| a; |0(/(fc)) ) we obtain 

the class XP, where each problem is polynomial-time solvable for any fixed value of 
k. There is an infinite number of parameterized complexity classes between FPT and 
XP (for each integer t > 1, there is a class W[i]) and they form the following tower: 

FPT C W[l] C W[2] C • • • C W[P] C XP. 

Here W[P] is the class of all parameterized problems (x, k) that can be decided in 
f(k)\x\°^ time by a nondeterministic Turing machine that makes at most f{k) log |a;| 
nondeterministic steps for some computable function /. For the definition of classes 
W[t], see, e.g., [7] (we do not use these classes in the rest of the paper). 

Given a pair L,L' of parameterized problems, a bikernelization from L to L' is a 
polynomial-time algorithm that maps an instance (x,k) to an instance [x' ,k') (the 
bikernel) such that (i) (x,k) 6 L if and only if (x',k') £ L', (ii) k! < f(k), and 
(iii) W\ < g(k) for some functions / and g. The function g(k) is called the size 
of the bikernel. The notion of a bikernelization was introduced in [I] , where it was 
observed that a parameterized problem L is fixed-parameter tractable if and only if it 
is decidable and admits a bikernelization from itself to a parameterized problem L' . 
A kernelization of a parameterized problem L is simply a bikernelization from L to 
itself; the bikernel is the kernel, and g(k) is the size of the kernel. Due to applications, 
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low degree polynomial size kernels are of main interest. 

Note that W/2 is a tight lower bound on the maximum weight of satisfiable equa- 
tions in a Max Lin system Az = b. Indeed, W/2 is the average weight of satisfied 
equations (as the probability of each equation to be satisfied is 1/2) and, thus, is 
a lower bound; to see the tightness consider a system of pairs of equations of the 
form J2iei z i = 0, J2iei z i = 1 01 weight 1. Mahajan et al. [121 [2] parameterized 
Max Lin as follows: given a Max Lin system Az — b, decide whether the total 
weight of satisfied equations minus W/2 is at least kl , where W is the total weight 
of all equations and k' is the parameter. This is equivalent to asking whether the 
maximum excess is at least k, where k = 2k' is the parameter. (Note that since 
k = 2fc', these two questions are equivalent from the complexity point of view.) Since 
W/2 is the average weight of satisfied equations, we will call the parameterized Max 
Lin problem Max Lin Above Average or Max Lin A A. Since the parameter k is 
more convenient for us to use, in what follows we use the version of Max Lin AA 
parameterized by k. 

Mahajan et al. [T3l [14] raised the question of determining the parameterized com- 
plexity of Max Lin AA. It is not hard to see (we explain it in detail in Section 
[2]) that we may assume that no two equations in Az = b have the same left-hand 
side and n = rank A. Using our maximum excess results, we prove that, under these 
assumptions, (a) Max Lin A A is fixed-parameter tractable if m < 2 P ^ for an arbi- 
trary fixed function p(n) = o(n), and (b) Max Lin A A has a polynomial-size kernel 
if m < 2 n for an arbitrary a < 1. We conjecture that under the two assumptions if 
m < 2 an for some constant a > 0, then Max Lin AA is W[l]-hard, i.e., result (a) 
is best possible in a sense. In addition, we prove that Max Lin AA is in XP (thus, 
Max Lin A A is polynomial-time solvable for every fixed fc), and, moreover, it is in 
W[P]. 

Recall that Max r-LiN AA is a special case of Max Lin AA, where each equation 
has at most r variables. In Max Exact r-SAT AA we are given a multiset of m 
clauses on n variables such that each clause has r variables and asked whether there 
is a truth assignment to the n variables that satisfies at least (1 — 2~ r )m + k2~ r 
clauses. Using our maximum excess results, we prove that for each fixed r > 2 
Max r-Lin AA has a kernel with O(fclogfc) variables and, thus, it can be solved in 
time 2°( fclogfe ) + m ^. This improves a kernel with 0(k 2 ) variables for Max r-Lin 
AA obtained by Gutin et al. [8]. Similarly, we prove that for each r > 2 Max 
Exact r-SAT AA has a kernel with 0(k log k) variables and it can be solved in time 
20(fcio g A;) _|_ m O(i) i m p rov i n g a kernel with 0(k 2 ) variables for Max Exact r-SAT 
AA obtained by Alon et al. pQ. Note that while the kernels with 0(k 2 ) variables were 
obtained using a probabilistic approach, our results are obtained using an algebraic 
approach. Using a graph-theoretical approach Alon et al. [T] obtained a kernel of 
Max Exact 2-SAT AA with 0(k) variables, but it is unlikely that their approach 
can be extended beyond r = 2. 

Fourier analysis of pseudo-boolean functions, i.e., functions / : { — 1,+1}™ — >• R, 
has been used in many areas of computer science(cf. [Tl 1161 IT7] V In Fourier analysis, 
the Boolean domain is often assumed to be { — 1, +1}™ rather than more usual {0, 1}" 
and we will follow this assumption in our paper. Here we use the following well-known 
and easy to prove fact [16] that each function / : { — 1, +1}" — > K can be uniquely 
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written as 



SC[n] i£S 



(1) 



where [n] — {1, 2, . . . , n} and each eg is a real. Formula ([TJ is the Fourier expansion 
/, cs are the Fourier coefficients of / (in the literature, these coefficients are often 
denoted by f(S) and we will use both cs and f(S) interchangeably), and the mono- 
mials Ilies x i form an orthogonal basis of ([1} (thus, the monomials are often written 
as xs{x) but we will use only Ylies Xi as ^ * s more transparent). 

Optimization of pseudo-boolean functions is useful in many areas including com- 
puter science, discrete mathematics, operations research, statistical mechanics and 
manufacturing; for many results and applications of pseudo-boolean function opti- 
mization, see a well-cited survey [3J- In classical analysis, there is a large number of 
lower bounds on the maxima of trigonometric Fourier expansions, cf. [4]. In Section 
[21 we prove a sharp lower bound on the maximum of a pseudo-boolean function using 
its Fourier expansion. The bound can be used in algorithmics, e.g., for approximation 
algorithms. 

2 Results on Maximum Excess 

Consider two reduction rules for Max Lin introduced in [5] for Max Lin A A. These 
rules are of interest due to Lemma Q] 

Reduction Rule 1. Let t — rankA and let columns a H ,...,a H of A be linearly 
independent. Then delete all variables not in {zi x , . . . , Zi t } from the equations of 
Az = b. 

Reduction Rule 2. If we have, for a subset S of [n], an equation X^gs Zi ~ ^' w ^ 
weight w' , and an equation X^es z i = b" with weight w" , then we replace this pair by 
one of these equations with weight w' + w" if b' = b" and, otherwise, by the equation 
whose weight is bigger, modifying its new weight to be the difference of the two old 
ones. If the resulting weight is 0, we delete the equation from the system. 

Lemma 1. Let A' z = b' be obtained from Az — b by Rule[l] or[H Then the maximum 
excess of A'z = b' is equal to the maximum excess of Az = b. Moreover, A'z = b' can 
be obtained from Az — b in time polynomial in n and m. 

To see the validity of Rule [1] consider an independent set / of columns of A of 
cardinality rank^4 and a column a J ^ /. Observe that a? = ^2 ieI t a 1 , where I' C I. 
Consider an assignment z = z°. If z® = 1 then for each i E I' U {j} replace 
by zf + 1. The new assignment satisfies exactly the same equations as the initial 
assignment. Thus, we may assume that Zj = and remove Zj from the system. For a 
different proof, see [S] • If we cannot change a weighted system Az = b using Rules Q] 
and [21 we call it irreducible. 

Consider the following algorithm that tries to maximize the total weight of satisfied 
equations of Az = b. We assume that, in the beginning, no equation or variable in 
Az = b is marked. 
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Algorithm H 

While the system Az = b is nonempty do the following: 

1. Choose an arbitrary equation ^2 i£S zi = b and mark zi, where 
I = min{i : i £ S}. 

2. Mark this equation and delete it from the system. 

3. Replace every equation X^eS' Zi ~ ^' m ^ ne system containing zi 

4. Apply Reduction Rule [5] to the system. 

Note that algorithm H replaces Az — b with an equivalent system under the 
assumption that the marked equations are satisfied; that is, for every assignment of 
values to the variables Zi,...,z n that satisfies the marked equations, both systems 
have the same excess. 

The maximum H-excess of Az = b is the maximum possible total weight of equa- 
tions marked by H for Az — b taken over all possible choices in Step 1 of T-L. 

Lemma 2. The maximum excess of Az = b equals its maximum f-L-excess. 

Proof. We first prove that the maximum excess of Az — b is not smaller than its 
maximum "H-excess. 

Let K be the set of equations marked by H. A method first described in [5] 
can find an assignment of values to the variables such that the equations in K are 
satisfied and, in the remainder of the system, the total weight of satisfied equations 
is not smaller than the total weight of falsified equations. 

For the sake of completeness, we repeat the description here. By construction, for 
any assignment that satisfies all the marked equations, exactly half of the non-marked 
equations are satisfied. Therefore it suffices to find an assignment to the variables such 
that all marked equations are satisfied. This is possible if we find an assignment that 
satisfies the last marked equation, then find an assignment satisfying the equation 
marked before the last, etc. Indeed, the equation marked before the last contains a 
(marked) variable zi not appearing in the last equation, etc. This proves the first part 
of our lemma. 

Now we prove that the maximum H-excess of Az = b is not smaller than its 
maximum excess. Let z = (zi, . . . , z n ) be an assignment that achieves the maximum 
excess, t. Observe that if at each iteration of H we mark an equation that is satisfied 
by z, then H will mark equations of total weight t. □ 

Remark 1. It follows from Lemma [2] that the maximum excess of a (nonempty) 
irreducible system Az — b with smallest weight u> m i n is at least w m i n . If all weights 
are integral, then the maximum excess of Az = b is at least 1. 

Clearly, the total weight of equations marked by T-L depends on the choice of 
equations to mark in Step 1. Below we consider one such choice based on the following 
theorem. The theorem allows us to find a set of equations such that we can mark 
each equation in the set in successive iterations of %. This means we can run % a 
guaranteed number of times, which we can use to get a lower bound on the H-excess. 
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Theorem 1. Let M be a set in ¥% such that M contains a basis of ¥2 , the zero vector 
is in M and \M\ < 2™. If k is a positive integer and k + 1 < \M\ < 2 n / k then, in 
time IMI *- 1 -*, we can find a subset K of M of k + 1 vectors such that no sum of two 
or more vectors of K is in M. 

Proof. We first consider the case when k = 1. Since \M\ < 2™ and the zero vector is 
in M, there is a non-zero vector v ^ M. Since M contains a basis for F^, v can be 
written as a sum of vectors in M and consider such a sum with the minimum number 

of summands: v = tti H H U£, £ > 2. Since u\ + u-i M, we may set K — {u\, ^2}- 

We can find such a set K in polynomial time by looking at every pair in M x M. 

We now assume that k > 1. Since k + 1 < \M\ < 2 n l k we have n > k + 1. 

We proceed with a greedy algorithm that tries to find K. Suppose we have a set 
L = {ai, . . . , a{\ of vectors in M , I < k, such that no sum of two or more elements of L 
is in M. We can extend this set to a basis, so a\ = (1,0,0,..., 0), a 2 — (0, 1, 0, ... , 0) 
and so on. For every a 6 M\L we check whether M\{ai, . . . ,ai,a} has an element 
that agrees with a in all co-ordinates I + 1, . . . , n. If no such element exists, then we 
add a to the set L, as no element in M can be expressed as a sum of a and a subset 
of L. 

If our greedy algorithm finds a set L of size at least k + 1 , we are done and L is our 
set K. Otherwise, we have stopped at I < k. In this case, we do the next iteration 
as follows. Recall that L is part of a basis of M such that a\ = (1, 0, 0, . . . , 0), a? = 
(0, 1, 0, . . . , 0), . . . . We create a new set M' in F^' , where n' = n- I. We do thitQby 
removing the first I co-ordinates from M , and then identifying together any vectors 
that agree in the remaining n' co-ordinates. We are in effect identifying together 
any vectors that only differ by a sum of some elements in L. It follows that every 
element of M' was created by identifying together at least two elements of M, since 
otherwise we would have had an element in M\L that should have been added to L 
by our greedy algorithm. Therefore it follows that \M'\ < \M\/2 < 2™/ fe - 1 . From 
this inequality and the fact that n' > n — k, we get that \M'\< 2 n '' k . It also follows 
by construction of M' that M' has a basis for F 2 l , and that the zero vector is in M'. 
(Thus, we have \M' \ > n' + 1.) If n' > k + 1 we complete this iteration by running the 
algorithm on the set M' as in the first iteration. Otherwise (n' < k), the algorithm 
stops. 

Since each iteration of the algorithm decreases n', the algorithm terminates. Now 
we prove that at some iteration, the algorithm will actually find a set K of k + 1 
vectors. To show this it suffices to prove that we will never reach the point when 
n' < k. Suppose this is not true and we obtained n' < k. Observe that n' > 1 (before 
that we had n' > k + 1 and we decreased n' by at most k) and |M'| > n' + 1. Since 
\M'\ < 2™ / fe , we have n! + 1 < 2 n / k , which is impossible due to n' < k unless n' = 1 
and k = 1, a contradiction with the assumption that k > 1. 

It is easy to check that the running time of the algorithm is polynomial in \M\, □ 

Remark 2. It is much easier to prove a non-constructive version of the above result. 
In fact we can give a non-constructive proof that k + 1 < \M\ < 2 n l k can be replaced 
by 2k < \M\ < 2 n ' k {{k - l)!) 1 /*. We will extend our proof above for the case k = 1 . 

1 For the reader familiar with vector space terminology: is modulo span(L), the subspace 
of Fj spanned by L, and M' is the image of M in F!? ' ■ 
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We may assume that k > 2. Observe that the number of vectors of F 2 that can be 
expressed as the sum of at most k vectors of M is at most 

( i r)+G M| 1 )+-+( l T l ) +1 - |M| * /(i - 1,,for|M|>2fc ' 

Since \M\ < 2 n ' k {{k - l)!) 1 / fc we have |F£| > \M \ k /(k - 1)! and, thus, at least for 
one vector a of F 2 we have a — mi + • • • + me, where I is minimum and I > k. Note 
that, by the minimality of £, no sum of two or more summands of the sum for a is in 
M and all summands are distinct. Thus, we can set K = {mi, . . . , mk+i}. 

Theorem 2. Let Az — b be an irreducible system, let w m i n be the minimum weight 
of an equation in Az = b, and let k > 2. If k < m < 2 n '( fe ~ 1 ) — 2, then the maximum 
excess of Az — b is at least k ■ u> m in- Moreover, we can find an assignment that 
achieves an excess of at least k ■ w m m in time m°^' . 

Proof. Consider a set M of vectors in F 2 corresponding to equations in Az — b as 
follows: for each ^2 iGS Zi = m Az = b, the vector v — (vi, . . . ,v n ) G M, where 
V{ = 1 if i S 5 and V{ = 0, otherwise. Add the zero vector to M. As Az — b is 
reduced by Rule [T] and k < m < 2 n ^ k ~ 1 ^ — 2, we have that M contains a basis for 
F' 2 l and k < \M\ < 2 n ^ k -^ - 1. Therefore, using Theorem [T] we can find a set K of 
k vectors such that no sum of two or more vectors in K belongs to M. 

Now run Algorithm % choosing at each Step 1 an equation of Az — b corresponding 
to a member of K , then equations picked at random until the algorithm terminates. 
Algorithm H will run at least k iterations as no equation corresponding to a vector 
in K will be deleted before it has been marked. Indeed, suppose that this is not true. 
Then there are vectors w £ K and v £ M and a pair of nonintersecting subsets K' and 

K" of K\{v,w} such that w + Y.uex' u = v + HueK" u - Thus > v = w + Y,ueK>uK>> u > 
a contradiction with the definition of K. 

In fact, the above argument shows that no equation of Az = b corresponding to 
a member of K will change its weight during the first k iterations of TL. Thus, by 
Lemma [2 the maximum excess of Az = b is at least k ■ w m i n . It remains to observe 
that we can once again use the algorithm given in the proof of Lemma [2] to find an 
assignment that gives an excess of at least k ■ w m i n . □ 

We now provide a useful association between weighted systems of linear equations 
on and Fourier expansions of functions / : { — 1, +1} — > R. Let us rewrite (JlJ, the 
Fourier expansion of such a function, as 

/(ao=/(0)+x>n*<. (2) 

where T = {0 ^ S C [n] : c s ^ 0}. 

Now associate the polynomial X)se.F c s l~Les Xi m © with a weighted system 
Az = b of linear equations on F 2 : for each S £ T , we have an equation X)ies Zi = 
with weight \cs\, where 6s = if C5 is positive and 65 = 1, otherwise. Conversely, 
suppose we have a system Az — b of linear equations on FJ? in which each equation 
SieS Zi = is assigned a weight ws > and no pair of equations have the same 
left-hand side. This system can be associated with the polynomial X^se-F - 5 ILes x ii 
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where cs = Ws, if b$ — 0, and cs = —ws, otherwise. The above associations provide 
a bijection between Fourier expansions of functions / : { — 1, +1} — > K with /(0) = 
and weighted systems of linear equations on FJ. This bijection is of interest due to 
the following: 

Proposition 1. An assignment — (zf\ . . . , Zn^) of values to the variables of 
Az — b maximizes the total weight of satisfied equations of Az = b if and only if 

x(°) = (( — l) 2 i , ■ • ■ , (— l) z "') maximizes f(x). Moreover, max l6 j_ lr) .i)n f( x ) ~ 
equals the maximum excess of Az — b. 

Proof. The claims of this lemma easily follow from the fact that an equation X^es z i = 
is satisfied if and only if riiGS 2 -* > ^, where X{ = (— l) Zi . □ 

3 Corollaries 

This section contains a collection of corollaries of Theorem [5] establishing parame- 
terized complexity of special cases of Max Lin AA, of Max Exact r-SAT, and 
of a wide class of constraint satisfaction problems. In addition, we will prove that 
Max Lin AA is in X[P] and obtain a sharp lower bound on the maximum of a 
pseudo-boolean function. 

3.1 Parameterized Complexity of Max Lin A A 

Corollary 1. Let p(n) be a fixed function such that p(n) — o(n). If m < 2 p (") then 
Max Lin AA is fixed-parameter tractable. Moreover, a satisfying assignment can be 
found in time g(k)m°^' for some computable function g. 

Proof. We may assume that m > n > k > 1. Observe that m < 2 n l k implies m < 
2 n/(fe-l) _ 2 . Thus, by Theorem H if p(n) < n/k, the answer to Max Lin AA is 
YES, and there is a polynomial algorithm to find a suitable assignment. Otherwise, 
n < f(k) for some function dependent on k only and Max Lin AA can be solved in 
time 

2 /(*) m O(i) by checking every possible assignment. □ 

Let Pi be the number of equations in Az — b containing z%, i — l,...,n. Let 
p = max ie [ n ] pi and let r be the maximum number of variables in an equation of 
Az — b. Crowston et al. [5] proved that Max Lin AA is fixed-parameter tractable 
if either r < r(n) for some fixed function r(n) — o(n) or p < p(m) for some fixed 
function p(m) = o(m). 

For a given r = r(n), we have m < (")• By Corollary 23.6 in [S], m < 

2 nH ( r / n \ where H(y) — —y\og 2 y — (1 — y) log 2 (l — y), the entropy of y. It is easy to 
see that if y — o{n)/n, then H{y) — o(n)/n. Hence, if r(n) — o(n), then m < 2°^ n \ By 
Corollary 23.5 in [9] (this result was first proved by Kleitman et al. [32]), for a given 
p = p(m) we have to < 2 nH( -p/ m \ Therefore, if p(m) = o(m) then to < 2 n '°( m )/ m and, 
thus, m < 2°'™^ (as n < m, if n — > oo then m — ¥ oo and o(m)/m 0). Thus, both 
results of Crowston et al. [S] follow from corollary [TJ 

Similarly to Corollary Q] it is easy to prove the following: 
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Corollary 2. Let < a < 1 be a constant. If m < 2 (™ a ) then Max Lin A A has a 
kernel with 0(k 1 ^^ a ') variables. 

By Corollary Q] it is easy to show that Max Lin A A is in XP. 

Proposition 2. Max Lin AA can be solved in time 0(m k+0 ^). 

Proof. We may again assume m > n > k > 1. As in the proof of Corollary [TJ if 
m < 2"/ fc then the answer to Max Lin AA is yes and a solution can be found in time 
m ^. Otherwise, 2™ < m k and Max Lin A A can be solved in time 0(m k+2 ). □ 

In fact, it is possible to improve this result, as the next theorem shows. 

Theorem 3. Max Lin AA is in W[P]. 

To prove this theorem we make use of the following lemma from [7] (Lemma 3.8, 
p. 48). Here k(x) is the value of the parameter on an instance x G £*. 

Lemma 3. A parameterized problem (Q, k) over the alphabet £ is in W[P] if and 
only if there are computable functions f, h : N — > N, a polynomial p(X), and a 
Y C £* x {0, 1}* such that: 

(i) For all (x,y) G S* x {0, 1}*, it is decidable in time f (k(x)) • p(\x\) whether (x,y) G 

Y. 

(ii) For all (x, y) G £* x {0, 1}*, if (x, y) G Y then \y\ = h{k(x)) ■ [\og 2 \x\\ . 

(iii) For every 

x G Q there exists a y G {0, 1}* such that (x, y) G Y. 

Proof of Theorem Recall from Lemma [2] that the maximum excess of Az = b is at 
least k if and only if we can run algorithm W a number of times and get a total weight 
of marked equations at least k. 

Suppose we are given a sequence ex, . . . , e; of equations to mark in each iteration 
of %. We can, at the i'th iteration of H, mark equation ej as long as is still in the 
system. If we are able to mark all the equations e±, . . . ei, we can then check that the 
total weight of these marked equations is at least k. If it is, then we know we have 
a YES-instance. Conversely, if the system has a maximum excess of at least k, then 
there will be some sequence e% , . . . , e; that gives us a total weight of marked equations 
at least k. Furthermore, by integrality of the weights, we may assume that I < k. We 
use this idea to construct a set Y that satisfies the conditions of Lemma [3] 

Firstly we show that a sequence of I < k equations can be encoded as a string 
y G {0, 1}* of length 2k ■ [\og 2 \x\\, where x is an instance of Max Lin AA. Let the 
equations be numbered from 1 to m, then we can express a sequence of equations 
e\, ■ ■ ■ &i, as a sequence of k integers between and m (if I < k then we end the 
sequence with k — I zeroes). Each integer between and m can be expressed by 
a string in {0,1}* of length at most |~log 2 m~\ < |~log 2 |x|], so certainly it can be 
expressed by a string of length 2 |log 2 \x\\ . Therefore we can express the k integers as 
a string of length 2k ■ [\og 2 \x\\. 
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For an instance x of Max Lin A A and a string y G {0, 1}*, let us call y a certificate 
for x if \y\ = 2k ■ [log 2 \x\\ and y encodes a sequence of k integers corresponding to a 
sequence of equations ei, . . . , ej in i, such that by marking each equation in turn in 
iterations of H, we get a set of marked equations of weight at least k. It follows that 
a; is a YES-instance if and only if there exists a certificate for x. Furthermore we can 
check in polynomial time whether y is a certificate of x by trying to convert y into a 
sequence of equations and running algorithm % marking those equations. (This is in 
fact a stronger result than we require for this proof - we only need that the algorithm 
is fixed-parameter tractable rather than polynomial.) 

We now let 

Y = {(x,y) G £*x{0, l}*\x is a YES-instance of Max Lin AA and y is a certificate of x} 

and let Q be the set of all YES-instances of Max Lin A A. By definition of Y and 
the definition of a certificate, conditions (ii) and (iii) of Lemma [3] are satisfied. As we 
can determine in polynomial time whether y is a certificate for x, condition (i) is also 
satisfied. Therefore, by Lemma Max Lin A A is in W[P]. 

□ 



3.2 Max r-Lin AA, Max Exact r-SAT AA and Max r-CSP AA 

Using Theorem [2] we can prove the following two results. 

Corollary 3. Let r > 2 be a fixed integer. Then Max r-LiN AA has a kernel with 
O(klogk) variables and can be solved in time 2°(' £losfe ) +UJ ' 1 '. 

Proof. Observe that m < n r and n r < 2™/( fe ~ 1 ) — 2 if n > c{r)k\og 2 k provided c(r) 
is large enough (c(r) depends only on r). Thus, by Theorem [2] if n > c(r)k\og 2 k 
then the answer to Max r-LiN AA is yes. Hence, we obtain a problem kernel with 
at most c(r)fc log 2 k — O(fclogfc) variables and, therefore, can solve Max r-LiN AA 
in time 2°( felogfc ) + m° (1 \ □ 

Corollary 4. Let r > 2 be a fixed integer. Then there is a bikernel from MAX EXACT 
r-SAT to Max r-LiN AA with O(Hogfc) variables. Moreover, Max Exact r-SAT 
has a kernel with O(fclogfc) variables and can be solved in time 2°(' £lo s fe ) _|_ 

Proof. Let F be an r-CNF formula with clauses C\ , . . . , C m in the variables Xx, X2, ■ ■ ■ , x. 
We may assume that Xi G { — 1,1}, where —1 corresponds to true. For F, following 
P] consider 

where var(C) is the set of variables of C, ti G { — 1, 1} and e; = 1 if and only if Xi is 
in C. It is shown in [T] that the answer to Max Exact r-SAT is yes if and only if 
there is a truth assignment x° such that 5(2;°) > k. 

Algebraic simplification of g{x) will lead us to Fourier expansion of g{x): 

9( x ) = X! Cs 11^' ( 3 ) 
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where T = {0 ^ S C [n] : c s ^ 0, |5| < r}. Thus, |J"| < n r . By Proposition [Q 
Ese.F Cs Ilies Xi can ^ e viewed as an instance of Max r-LlN and, thus, we can reduce 
Max Exact r-SAT into Max r-LlN in polynomial time (the algebraic simplification 
can be done in polynomial time as r is fixed) . By Corollary [31 Max r-LlN has 
a kernel with O(fclogfc) variables. This kernel is a bikernel from Max Exact r- 
SAT to Max r-LlN. Using this bikernel, we can solve Max Exact r-SAT in time 

2 0(fclogfc) +m O(l)_ 

It remains to use the transformation described in [I] of a bikernel from Max Ex- 
act r-SAT to Max r-LlN into a kernel of Max Exact r-SAT. This transformation 
gives us a kernel with 0(k\ogk) variables. □ 

In the Boolean Max-r-Constraint Satisfaction Problem (MAX-r-CSP), we are 
given a collection of Boolean functions, each involving at most r variables, and asked 
to find a truth assignment that satisfies as many functions as possible. We will 
consider the following parameterized version of MAX-r-CSP. We are given a set $ of 
Boolean functions, each involving at most r variables, and a collection T of m Boolean 
functions, each / € J being a member of <i>, and each acting on some subset of the 
n Boolean variables xi, X2, ■ ■ . , x„ (each x, G { — 1,1}). We are to decide whether 
there is a truth assignment to the n variables such that the total number of satisfied 
functions is at least E + k2~ r , where E is the average value of the number of satisfied 
functions. 

Corollary 5. Let r > 2 be a fixed integer. Then there is a bikernel from Max r- 
CSP to Max r-LlN AA with O(fclogfc) variables. Max r-CSP can be solved in time 

2 0(fclogfc) +to O(1)_ 

Proof. Following [2] for a boolean function / of r(/) < r boolean variables x^, . . . , X{, f 
introduce a polynomial hf(x), x = (xi, X2, . . . , x n ) as follows. Let Vf C {— 1, l} r O 
denote the set of all satisfying assignments of /. Then 

r(/) 

[vi,...,v rlf) )eVf 3=1 

Let h(x) = 'Yl, f (zjr h j (x) . It is easy to see (cf. [T]) that the value of h(x) at a; 
is precisely 2 r (s — E), where s is the number of the functions satisfied by the truth 
assignment x , and E is the average value of the number of satisfied functions. Thus, 
the answer to MAX-r-CSP is yes if and only if there is a truth assignment a; such 
that h(x°) > k. The rest of the proof is similar to that of CorollaryH] □ 

3.3 Lower Bound on Maxima of Pseudo-boolean Functions 
Corollary 6. We have max ce{ _ 1 , +1} „ f{x) > /(B)+(l+[ k J^ 2) J)-min geJ P 

Proof. Consider the system Az = b associated with the Fourier expansion of / ac- 
cording to the bijection described before Proposition [TJ We may assume that the 
weighted system Az = b has been simplified using Rule [Tj and, thus, its number n' of 
variables equals rankA Note that n' < m, where m is the number of equations in 
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Az = b. By Theorem [21 Proposition [T] and the fact that minsgjr = min.,- Wj, it 

follows that if k < m < 2"'/(' £ - 1 ) - 2 then 

max fix) - /(0) > fcmin 1/(5)1. 
xe{-i,+i} n ser 

To complete the proof, recall that n' = rankA, m = \F\ and observe that the 
maximum possible (integral) value of k satisfying m < 2™ /( fc_1 ) —2 is 1+ [ log r ^"j.'j 4 | _ 2 ^ j . 

og 2 ^ 

This bound is tight. Indeed, consider the function f(x) — — Y^QjtscM ILes 2 '* - 
Observe that n = rankA, IJ 7 ) = 2™ — 1 and, thus, max ie |_ L+1 } n f(x) > 1 + 
L io°-" | !f"|+2) J = 1- If = (1,1,. ..,1) then f(x) = — \J-\ and if we set some Xi = — 1 then 
after canceling out of monomials we see that f(x) — 1. Therefore, max l6 {_ 1+1 )n f( x ) 
1 . and, thus, the bound of corollary [5] is tight. It is easy to see that the bound remains 
tight if we delete one monomial from f[x). A sightly more complicated function show- 
ing that the bound is tight is as follows: g{x) = - X^,sc[ni] ILes x i~T,seg Hies x ^ 
where m < n and Q = {S : ^ S C [n], [m] n S = 0}. 

Remark 3. Consider Max Lin with irreducible system = b in which every 
equation is of weight 1. Then the bound of Theorem [6] gives an (l/2 + (l + 6)/m)- 
approximation for Max Lin, where 8 = [n/ log 2 (m + 2) J . This is of interest since by 
the result of Hastad mentioned in SectionO] (1/2 + e)-approximation is impossible for 
any constant e > unless P=NP. 
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